Adjusting the Frame: Biphasic Performative Control of Speech Rhythm
نویسندگان
چکیده
Performative time and pitch scaling is a new research paradigm for prosodic analysis by synthesis. In this paper, a system for real-time recorded speech time and pitch scaling by the means of hands or feet gestures is designed and evaluated. Pitch is controlled with the preferred hand, using a stylus on a graphic tablet. Time is controlled using rhythmic frames, or constriction gestures, defined by pairs of control points. The ”Arsis” corresponds to the constriction (weak beat of the syllable) and the ”Thesis” corresponds to the vocalic nucleus (strong beat of the syllable). This biphasic control of rhythmic units is performed by the non-preferred hand using a button. Pitch and time scales are modified according to these gestural controls with the help of a real-time pitch synchronous overlap-add technique (RT-PSOLA). Rhythm and pitch control accuracy are assessed in a prosodic imitation experiment: the task is to reproduce intonation and rhythm of various sentences. The results show that inter-vocalic durations differ on average of only 20 ms. The system appears as a new and effective tool for performative speech and singing synthesis. Consequences and applications in speech prosody research are discussed.
منابع مشابه
Vokinesis: syllabic control points for performative singing synthesis
Performative control of voice is the process of real-time speech synthesis or modification by the means of hands or feet gestures. Vokinesis, a system for real-time rhythm and pitch modification and control of singing is presented. Pitch and vocal effort are controlled by a stylus on a graphic tablet. The concept of Syllabic Control Points (SCP) is introduced for timing and rhythm control. A ch...
متن کاملCreating an individual speech rhythm: a data driven approach
Generating a near-to-natural speech rhythm can greatly contribute to the user's acceptance of TTS systems. Beside common aspects of the rhythm control (correctness of the segmental durations, robust function, etc.) rhythmic flexibility for several applications and individual speaking styles are desired. This article describes a data driven concept, which aims at the generation of an individual ...
متن کاملMAGE 2.0: New Features and its Application in the Development of a Talking Guitar
This paper describes the recent progress in our approach to generate performative and controllable speech. The goal of the performative HMM-based speech and singing synthesis library, called Mage, is to have the ability to generate natural sounding speech with arbitrary speaker’s voice characteristics, speaking styles and expressions and at the same time to have accurate reactive user control o...
متن کاملPerformative faces
The paper presents a model for the construction of an artificial agent that can express performatives through facial expression. The performative of a speech act or communicative act is the particular communicative intention a Sender has to one's Addressee, the way one wants to socially relate oneself to the interlocutor. Performatives are decomposed both on the meaning and on the signal side: ...
متن کاملHereby explained: an event-based account of performative utterances
Several authors propose that performative speech acts are self-guaranteeing due to their self-referential nature (Searle 1989; Jary 2007). The present paper offers an analysis of self-referentiality in terms of truth conditional semantics, making use of Davidsonian events. I propose that hereby can denote the ongoing act of information transfer (more mundanely, the utterance) which thereby ente...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017